Conformational transitions in random heteropolymer models 
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We study the conformational transitions in heteropolymers within the frames of a lattice model 
containing two types of monomers A and B. Such a model can describe in particular the sequences of 
■ hydrophobic and hydrophilic residues in proteins (K.F. Lau and K.A. Dill, Macromolecules 22, 3986 

(1989)) and polyampholytes with oppositely charged groups (Y. Kantor and M. Kardar, Europhys. 
Lett. 28, 169 (1994)). We generalize this lattice model by introducing various types of short-range 
£SJ ■ monomer-monomer interactions. Applying the pruned-enriched Rosenbluth chain-growth algorithm 

(PERM) we analyze numerically the peculiarities of transitions from extended into compact states 
1 as function of the fraction of A and B monomers along the heteropolymer chain. 
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Ch 1 I. INTRODUCTION 

A subject of great interest in both chemical and biological physics are the conformational properties of long het- 
eropolymer chains. Typical examples are proteins, consisting of sequences of amino acid residues, connected by 
peptide bonds. The conformations of individual macromolecules are controlled by the type of monomer- monomer 
interactions. In general, the constituents (monomers) of macromolecules in an aqueous environment can be char- 
acterized as hydrophilic or hydrophobic, depending on their chemical structure. Hydrophilic residues tend to form 
hydrogen bonds with surrounding water molecules, whereas the hydrophobic monomers effectively attract each other 
and tend to form a dense hydrophobic core. For a flexible polymer chain in an athermal solvent, the steric hard-core 
£j repulsions between monomers are dominant, and a typical polymer chain usually attains a coil-like conformation. In 
— - J < this regime, the size measures of a typical TV- monomer chain (such as the mean-square end-to-end distance (i? 2 ) or 
the mean-square radius of gyration (i? 2 )) obey scaling laws [l|-y|: 



(Rl) ~ (Rl) ~ iV 2 — 



>: 

oo. 

■ 

with a universal exponent v co n — 0.587597(7) in three dimensions 4]. At temperatures below the so-called O- 
temperature, growing effect of hydrophobic monomers attraction makes the polymer radius shrink, which causes the 
. collapse transition into the compact globule-like regime. Exactly at the ©-temperature, the macromolecules behave like 
^-H 1 an ideal Gaussian chain (up to logarithmic corrections) with size exponent v® = 1/2. The conformational properties of 
fNJ ■ such heteropolymers can be successfully studied within the frames of a lattice model, where each monomer is treated 
as a point particle on a regular lattice. For simplicity, it was proposed to consider lattice polymers which contain only 
two types of monomers, say A (hydrophobic) and B (hydrophilic), the so-called HP model [5]. The conformational 
state of such a model is defined by the short-range nearest-neighbour attraction between any pair of (non-bonded) 
A-monomers along the SAW trajectory, and thus depends on the fraction of residues of the two types and on the 
sequence of residues along the chain. In further generalizations of this model the contact interaction is replaced by 
a distance-dependent Lennard- Jones type of potential, being attractive for AA and BB pairs and repulsive for AB 
pair of monomers [6(. Considerable attention of researchers has been paid to identifying a compact globular state 
(associated with the global energy minimum) of a chain with fixed A and B sequence (describing proteins with a 
given sequence of amino acid residues) 043 • 

On the other hand, the long-range nature of the electrostatic Coulomb interaction between charged groups also 
produces crucial effects on heteropolymer conformations. Of particular interest are so-called polyampholytes (PAs) [lil . 
15| : heteropolymers, comprising both positively and negatively charged monomers. Examples of polyampholytes are 
synthetic copolymers bearing acidic and basic repeat groups. It is established, that PAs possess a rich conformational 
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behaviour, strongly dependent both on their overall charge Q and the quality of solvent (and thus on temperature 
T) [lll - t^ij ]. If positive and negative charges are almost balanced (Q is small), the attractive Coulomb interaction 
dominates, and the polymer collapses into a globular (sphere-like) state at low temperatures. If the energy of the 
electrostatic interactions prevails the globular surface energy, which happens when the excess charge Q approaches 
Q c ~ N 1 / 2 , molecules split into extended necklace-like sequences of small connected globules. In the high-temperature 
regime but small Q (predominant electrostatic attraction), the PA forms globule-like structures, whereas at the critical 
value of the excess charge Q c ~ N 1 / 2 the charged polymer attains the shape of a neutral polymer coil. Polyampholytes 
with considerable disbalance between positive and negative charges (Q > Q c ) and thus predominant Coulomb repulsion 
are expected to attain a stretched rod-like configuration. A particular model of PA with strongly screened short-range 
Coulomb interactions (as proposed in Ref. (25[) appears to be in a close relationship with the HP model as described 
above; really, in this case one has mutual repulsion between AA and BB pairs and attraction for AB pairs. 

In the present work, we aim to study several generalizations of a lattice heteropolymer model with various types 
of monomer- monomer interactions. In particular, this allows us to compare the properties of HP model and model 
of a polyampholyte chain with strongly-screened Coulomb interaction directly within the same approach. The main 
accent will be made on the peculiarities of conformational transitions from extended into compact state as function 
of the fraction of A and B monomers along the heteropolymer chain. 

The layout of the rest of the paper is as follows: in the next Section II we introduce the model and briefly describe 
the pruned-enriched Rosenbluth method, applied in our study. The results obtained are given in Section III. We end 
up by giving conclusions and an outlook in Section IV. 



II. MODEL AND METHOD 



We start with a lattice model of a linear polymer chain with Na monomers of type A and Nb = N — Na monomers 
of type B in d = 3 dimensions. Denoting by e an interaction energy of any pair of monomers residing on neighboring 
lattice sites that are not connected by a covalent bond, we consider 5 different cases: 



Model 1 
Model 2 
Model 3 
Model 4 
Model 5 



£AA = £BB = 1,£AB = -1, (1) 

£AA = ebb = —1,£AB = 1) (2) 

£AA = 1, EBB = SAB = 0, (3) 

£AA = -1, £BB = £AB = 0, (4) 

EAA = -1,EBB = }-,ZAB = 0. (5) 



The model 1, where the like monomers re pel and the opposite ones attract, refers to strongly screened Coulomb 
interactions, as studied previously in Ref. [25|]. The model 3 is a particular case of model 1 and corresponds to a 
polymer chain containing charged (A) and neutral (B) monomers. Model 4 refers to the (minimal) HP model [5[ with 
hydrophobic (A) and hydrophilic (B) monomers. Cases 2 and 5 can be considered as generalizations of the two above 
mentioned models. 

Let us introduce the inhomogeneity ratio: 

so that the cases c = 1 (c = 0) correspond to homogeneous chains with all monomers of type A (B). Note also, that 
models 1 and 2 are "symmetrical" in an energetical sense with respect to the value c = 0.5. 

To study the conformational properties of models 1-5, we apply the pruned-enriched Rosenbluth method (PERM) 
(26j . The chain grows step by step, i.e., the nth monomer is placed at a randomly chosen neighbour site of the last 
placed in — l)th monomer (n < N). The growth is stopped, if the total length of the chain, N, is reached (we consider 
chains of length up to N = 100). The total energy of a chain E(N) is given by 

N N 

^)=EE^. (7) 

i=l j=i+2 

where each i and j is either of the type A or B, correspondingly is 1, —1 or depending on the model, and cr^ = 1 
if monomers i and j are nearest neighbours and zero otherwise. The Rosenbluth weight factor W n is thus taken to be 



W n =J[mie ; (g) 

1=2 
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where m; is the number of free lattice sites to place the Zth monomer, E(l) denotes the energy of the /-step chain, 
and kg is the Boltzmann constant. In what follows, we will assume units in which kg = 1. 

Pruning and enrichment are performed by choosing thresholds and depending on the current estimate of 
the sum of weights Z n — X) C onf °f * ne rt- monomer chain [O, Ha, H3] • ^ the current weight W n of an n- monomer 

chain is less than W<, the chain is discarded with probability 1/2, whereas if W n exceeds W> , the configuration is 
doubled (enrichment of the sample with high- weight configurations) . 

When studying random hereropolymers, a double averaging has to be performed to receive the quantitative value 
for any observable. At first, the conformational averaging is performed over an ensemble of possible conformations of 
a macromolecule with a fixed sequence of A and B monomers under a given inhomogeneity ratio c value along the 
chain: 

Z^conf VV N 

where is the weight of an iV-monomer chain in a given conformation. The second is the "sequence average" 

(...), which is carried out over different random sequences of A and B monomers under given c value: 

i M 

<°> = mE<°>'- ( 10 ) 

i=l 

Here, M is the number of different sequences and the index i means that a given quantity is calculated for sequence i. 
We consider up to M = 1000 random sequences at given inhomogeneity ratio value. Note, that the case of so-called 
"quenched disorder" is considered, where the average over different sequences is taken after the configurational average 
has been performed. 



III. RESULTS 



The averaged energy of heteropolymer models 1-5 can be predicted based on the following considerations. Let us 
introduce the averaged number of nearest-neighbour contacts p for any monomer in a chain, which can be estimated 
from the empirical relation: 



V 



2d-l 



fill 



where z is the fugacity (connectivity constant) of the polymer chain. Having a heterogeneous polymer chain with 
inhomogeneity ratio c (so that Na = N ■ c, Nb = N ■ (1 — c)), we can estimate the number of A A nearest-neighbour 
contacts: 



1 1 2 



(12) 
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FIG. 1: The averaged energy per monomer of an N = 100-monomer heterogeneous chain in the high-temperature regime 
with various types of intramonomer interaction as function of c. Lines are the corresponding results of empirical estimations 
(fT5l>-([T9l>. 
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FIG. 2: The averaged energy per monomer of an N = 100-monomer heterogeneous chain with various types of intramonomer 
interaction as function of c at temperature T = 4.0. Lines are guides to the eye. 



the number of BB nearest-neighbour contacts: 

n BB = ^N BP (l -c) = -JVp(l - c) 2 
and the number of AB nearest-neighbour contacts: 

n A B = N A p{l - c) = Npc(l - c). 
Thus, for the total energies of models 1-5 we have correspondingly: 



^Modcl 1 
^Modol2 



TIaa + n BB - n AB = 2Np c - c + 



-uaa - n BB + n AB 



1 



-2Np [<? - c ■ 



E 



Model 3 



n A A = -j^Npc , 



E 



Model 4 



-nAA 



E 



Model 5 



-UAA + n BB = Np | - - r 



(13) 
(14) 

(15) 

(16) 

(17) 
(18) 
(19) 



In the high-temperature regime, the steric self-avoidance effect between monomers plays the only role in determining 
the conformational properties of polymers. Thus taken the known value z—4. 68404(9) 28] (which gives p = 0.31596(9) 
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FIG. 3: Averaged energy per one monomer of the heterogeneous polymer chain with various types of monomer-monomer 
interactions as function of temperature at fixed inhomogeneity ratio c = 0.7 and c = 0.2. 



1.5 
1 

0.5 




0.5 1 



1.5 



2 

T 



c=0.7 
Model 1 ■ 
Model 2 o 
Model 3 * 
Model 4 D 
Model 5 ♦ 

2.5 3 3.5 



1.5 

1 - 
0.5 - 
0- 



0.5 1 1.5 



2 

T 



c=0.2 
Model 1 ■ 
Model 2 * 
Model 3 " 
Model 4 ° 
Model 5 ♦ 

2.5 3 3.5 



FIG. 4: Averaged end-to-end distance of the heterogeneous polymer chain with various types of monomer-monomer interactions 
as function of temperature at fixed inhomogeneity ratio c = 0.7 and c = 0.2. 



according to (|11[) ). we can obtain numerical estimates for (I15l) - (|19p . These expressions are perfectly confirmed by our 
numerical simulations (see Fig. Q]), where the corresponding energies per monomer are presented. 

With lowering the temperature, the interactions between monomers become essential, and the p-values for each 
model appear to be dependent on the inhomogeneity ratio c. As an example, we present our simulation results for 
corresponding energies per monomer at T = 4.0 in Fig. [2] Though for all models the polymer chains are expected 
to be still in an extended state (coil regime) at this temperature, the typical configurations they attain at given 
inhomogeneity ratio are governed by that nearest-neighbour contact value which minimizes the total energy. Indeed, 
for model 3 the most preferable configurations would be those with smaller value of nearest-neighbour contacts, 
whereas for model 4 the larger possible value of p will minimize the total energy. The values of p for all the models 
at various c- values can be estimated on the basis of data presented in Fig. [5] applying expressions (fl5")) - (lll?)) . 

In Fig. [3] we present our results for the averaged energies of the N = 100-monomer heterogeneous chain as function 
of temperature at fixed inhomogeneity ratios c = 0.7 and c = 0.2. For c = 0.7, the energy of the models 1, 2, 4 
and 5 decreases monotonically as temperature is lowered and sharply falls down within a short temperature interval, 
indicating the transition from extended state (coil regime) to compact state. For model 3, however, one observes 
only a monotonic decrease of energy, which gradually tends to zero with lowering the temperature, and thus no 
transition occurs. Moreover, within the frames of model 3, decreasing the energy means decreasing the averaged 
number of nearest-neighbour contacts, and thus expanding the chain. Thus, unlike the other models, in this case the 




T 



FIG. 5: Averaged end-to end distance of a heterogeneous polymer chain divided by number of monomers as function of 
temperature at inhomogeneity ratio c = 0.7 at various values of N from 68 up to 100. 
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Phase diagrams of heterogeneous polymer chain in T-c space, (a) model 1, (b) model 2, (c) model 4, (d) model 5. 



averaged size of the heteropolymer chain increases with lowering the temperature. Similar results are found at fixed 
inhomogeneity ratio c = 0.2. In this case, the model 5 consists mainly of monomers, which repel each other, and, like 
in model 3, conformations with a minimum number of nearest-neighbour contacts are energetically preferable, which 
leads to expanding the chain size with lowering the temperature. Note also that at small concentration of attractive 
monomers, the chains can attain the compact state only when they are long enough and have enough attractive 
nearest-neighbour contacts to overcome the conformational entropy. 

In Fig. [Jour results for the averaged end-to-end distance of the N = 100-monomer heterogeneous chain as function 
of temperature are given at the same two fixed values of inhomogeneity ratio c = 0.7 and c = 0.2. These results 
confirm our discussions above. For the case c = 0.7, one notices a decrease of the polymer chain size with lowering 
the temperature (models 1, 2, 4 and 5), which becomes crucial close to the transition temperature into the compact 
state. However, in model 3 the polymer chain is expanding its size with lowering the temperature due to repulsion 
between monomers, and the polymer chain remains in an extended state at any temperature. For c = 0.2, the size 
of polymer chain of model 5 increases even stronger than that of model 3 due to the considerably larger amount of 
mutually repelling monomers; in both cases the polymer chain remains in an extended state at any temperature. 

To obtain a quantitative description of conformational transitions from extended to compact state in models 1-5, we 
study the ratio (Rl) /N for various N and fixed inhomogeneity ratios c as function of temperature (Fig. [5J. Assuming 
that the collapse of heteropolymers is through the 0-transition similar to homogeneous polymers, we expect (R^) ~ N 
at the 0-point, and thus the curves intersect in the vicinity of the transition temperature (the larger N, the steeper 
the curves). Despite the slight dependence of the intersection points on N, we can locate the crossover temperature. 
If at some T and c values the curves do not intersect, there is no phase transition and the polymer is in an extended 
state at any temperature. 

As a result, we obtain the phase diagrams, presented in Fig. [5] Note, that within the frames of model 3 the 
polymer chain is always in an extended state at any temperature and inhomogeneity ratio value, thus we do not 
construct any phase diagram for this model. Models 2, 4 and 5 describe homogeneous polymer chains with nearest- 
neighbour attractions in the limiting case c = 1 (for model 2 also c = 0) with known value of transition temperature 
T® = 3.717(3) [26j]. In model 2, the 0-transition is present always at any value of c, whereas models 1, 4 and 5 
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remain in the extended state when the concentration of attracting monomers is too small to cause a transition into 
the compact state. The results for the case of model 1 are in a good correspondence with those obtained previously 
in Ref. EH- 



IV. CONCLUSIONS 

We studied the conformational properties of hetcropolymers within the frames of a lattice model containing Na 
monomers of type A and Nb — N — Na monomers of type B in d — 3 dimensions. Restricting ourselves only to 
short-range interactions between any pair of monomers residing on neighboring lattice sites that are not connected 
by a covalent bond, we consider 5 different generalizations of this model (JT|)— (JSJ) - In particular, the model 1 where 
the like monomers repel and the opposite ones attract, refers to strongly screened Coulomb interactions, as studied 
previously in Ref. [25j. The model 3 is a particular case of model 1 and corresponds to a polymer chain containing 
charged (A) and neutral (B) monomers. Model 4 refers to the (minimal) HP model [|[ with hydrophobic (A) and 
hydrophilic (B) monomers. Cases 2 and 5 can be considered as generalizations of the two above mentioned models. 

Applying the pruned-enriched Rosenbluth chain-growth algorithm (PERM) we analyze numerically the peculiarities 
of transitions from extended into compact state as function of inhomogeneity ratio c = Na/N for all the heteropolymer 
chain models. Within the frames of model 3, unlike the other models, the polymer chain is expanding its size with 
lowering the temperature due to repulsion between monomers, and the polymer chain remains in an extended state 
at any temperature. In model 2, the O-transition is present always at any value of inhomogeneity ratio c, whereas 
models 1, 4 and 5 remain in an extended state when the concentration of attracting monomers is too small to cause 
a transition into the compact state. Note also that at small concentration of attractive monomers, the chains can 
attain the compact state only when they are long enough and have enough attractive nearest-neighbour contacts 
to overcome the conformational entropy. Models 2, 4 and 5 describe homogeneous polymer chains with nearest- 
neighbour attractions in the limiting case c = 1 (for model 2 also c = 0) with known value of transition temperature 
T = 3.717(3) HI. 
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